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Abstract 

We consider a version of the forest fire model on graph G, where each vertex of a graph 
becomes occupied with rate one. A fixed vertex vq is hit by lightning with the same rate, 
and when this occurs, the whole cluster of occupied vertices containing vo is burnt out. We 
show that when G = Z+, the times between consecutive burnouts at vertex n, divided by 
log n, converge weakly as n — > oo to a random variable which distribution is 1 — p(x) where 
p{x) is the Dickman function. 

We also show that on transitive graphs with a non-trivial site percolation threshold and 
one infinite cluster at most, the distributions of the time till the first burnout of any vertex 
have exponential tails. 

Finally, we give an elementary proof of an interesting limit: lim n _^ 00 Ylk=i (&) ( — ■"■)* 1°§ k~ 
log log n = 7. 

Keywords: forest fire model, percolation, Dickman function, Stirling numbers. 
Subject classification: primary 60G55, 60K35; secondary 60F05. 

1 Introduction and results 

Consider the following forest fire model on Z + = {0, 1, 2, . . . }. Let rj x (t) G {0, 1} be the state of 
site x G Z + at time t > 0, and we say that site x is vacant if r] x = and occupied, if r\ x — 1. The 
vacant sites become occupied with rate 1; once they are occupied, they can only be "burnt" by 
a fire spread from a neighbour, which reverses them to the original vacant state. Imagine that 
there is a constant source of fire attached to site 0. Hence, whenever site becomes occupied, 
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the whole connected cluster of occupied sites containing is instantaneously burnt out. Denote 
the process we obtain as {r) x (t)}. We are interested in the dynamics of process {fj x (t)}, as time 
passes by, under the assumption that all sites are initially vacant, i.e. f] x {0) = for all x. 

Note that our model differs from more classical versions presented in p[] and [5], where each 
occupied site can be ignited at rate A, and then the cluster containing this site disappears. On 
the other hand, our model on Z + turns out to be a special case of the one studied in [5], where 
some of the results, independently obtained in the present paper, are also given. This covers, 
for example, the recursion (j2J), and also most of Lemma El but none of the limiting statements 
derived in Theorems [5] and El Forest fire models have also been recently considered on Erdos- 
Renyi random graphs, see [19|. 

Let T x (i), i — 1, 2, . . . , be the consecutive times when site x is burnt for the z-th time, and let 
T x (0) = 0. Let T n (i) = T n (i) — T n (i — 1) for i > 1. We can easily show that for a fixed n, r n (i)'s 
are i.i.d. random variables; this can be done by induction on n. Indeed, the times of burnouts at 
(n + 1) depend only on T n 's and the Poisson arrival process at site (n + 1) itself. Since for each j 
necessarily T n+ i(j) = T n {i) for some i, T n+ i(j)'s are renewal times, and hence r n+ i(j)'s are i.i.d. 
as well. 

Now we would like to find the distribution of r n+1 (i)'s. For site this is trivial as the burn-out 
times constitute a Poisson process, so that 

P(r > u ) = e~ u , u>0. 
Reasonably easy one can also obtain 

P(n > u) = (u + l)e~ u , 
so that n has T(2, 1) distribution with density ue~ u \ similarly 

P( T2 > u) = (2"' + 10" + 7)e- + e-» 

From the above calculations we conclude that 

E(r ) = 1, E(n) = 2, E(r 2 ) = 8/3; 
Var(r ) = 1, Var(n) = 2, Var(r 2 ) = 8/3. 

Incidentally, this suggests that Var (r n ) = Er n , which, however, turns out to be incorrect, as 
follows from Remark [31 

For a general n, let <p n {t) = Ee' Tn be the moment generating function of random variable 
r n . Suppose that sites n and n + 1 have just been burnt, and without loss of generality reset 
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the time to t — 0. Let i] ~ exp(l) be the time till the next Poisson arrival at site (n + 1). The 
next burnout at site (n + 1) will be either at time t = r n if rj < r n , or at a later time otherwise; 
in the latter case due to the memoryless property of the Poisson process the time between r n 
and the next burnout at (n + 1), denoted by f n+ i, will have the same distribution as r n+ i itself. 
Therefore, given r n , 

0, if V < T n, 

f n+ i, if r] > r n . 



r n +i = T n + 
Consequently, 



E (e tTn+1 | T n ) = e tTn [(1 - e- Tn ) ■ 1 + e~ T " • E (e tfn+1 | r n )] 

using the fact that f n+1 is independent of r n . Taking the expectation on both sides, we obtain 

(p n +i(t) = (p n (t) - (p n (t - 1) + (p n (t - l)(p n+1 (t) 

whence 



Let u n (t) — (p n (t) — 1. Then 



y? TO (t) - y? TO (t - 1) 

1 - lf n {t - 1) 



Since <fo(t) = e tx e x dx = 1/(1 — t) yielding u (t) = t/(l — t), we can easily iteratively 
compute u n {t). For example, 

/x t(2-t) , N t(2 - 1) 3 



(i-tr zw (i-tm-t) 

which is consistent with our previous calculations of the distributions of r x and r 2 . 
Lemma 1 For n = 1,2, ... 

t.(2-i)(S) -(4 -*)(") -(6-t 



u n -i(t) = 



(1 - ■ (3 - • (5 - . . . 

with the convention that (™) = whenever k > n. Thus r n is a mixture of Gamma random 
variables with the moment generating function 

n 

y>„-i(*) = i + tH(k-t)^ k ^ 

k=l 

defined for all t < 1. 
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Proof. By induction, using © and the fact that (™) + ( fe "J = 
Lemma 2 Let /i n = Er n . T/ien /or n— 1,2, ... 

log/i n _i = ^ ( . J(-l) l logi; 

i=i 

E ( eo = £(»)<=£ 

i=l \ ' 



Moreover, 



lim log ( ) = 7 

log 71, 



where 7 = 0.577... is the Euler constant. 

The following two lemmas will be proved in Section [HJ 
Lemma 3 Let 

then lirn n _ ! . 00 (A„ — log log n) = 7, where 7 = 0.577... is the Euler constant. 



Remark 1 After this paper has been written, we learned (Bdlint Toth, personal communications) 
that the above limit is in fact derived in |HJ/ ; Theorem 4, though no explicit proof was given there. 
Thus we shall give a reasonably short and elementary self-contained proof of this convergence. 



Lemma 4 Let 

'n\ (-l) k+i 

k=l 



k k r 



Then 



(a) a(n,m)= ^ 



ziz 2 ... i r 

l<j 1 <i2<---<i m <n 



{h) a(n, m) < (logra + l) m for all n > 1; 



log m n 

(c) a(n, m) = - h 0(log m_1 n) for a fixed m as n — > 00. 

m! 
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Remark 2 The quantities a(n,m) are closely related to the Stirling numbers of the second kind: 

i£ g) (-u-'*», 

! — n \ / 



n 

m I a' 



and up to a coefficient of proportionality coincide with "negative-positive" Stirling numbers in 
|7|/ ; see equations (68) and (78) there. 

Proof of Lemma \2i The first part follows immediately from Lemma [T] and the properties of 
moment-generating functions; the second part follows from Lemma [3j ■ 

Remark 3 Lemma [H together with part (c) of Lemma^ yield that 



lim E ' ' " 
n-+oo \ logn 

lim E ' " 
n-»oc Vlogn 



1.7$ 



2e 7 « 3.56 . . . 



whence for large n, Var (r n ) ^Er„. 

Theorem 1 Let £ n = r„/logn. T/ien as n oo 

(meaning convergence in distribution) where £ is a random variable with mean E£ = 7', and £/ie 
moment generating function 



Ee^ = l + exp{Ei(s)}. 



(3) 



Here 7' = e 7 = 1.781... and 

Ei( S ) 



— dx = 7 + log s + 

a; ^— ' 



m=l 



m ■ ml 



is the exponential integral (understood in terms of the Cauchy principal value; see fl], Section 
5.1 and formula 5.1.10). 



Proof. Observe that 

"«n-l(*) 



log 



k=l 



1 -* 



00 J.771 

m=l 



E 

.fc=i 



?? 



\fc-i 



E 

m=l 



a(n, rn) t" 



m 
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where 



a(n, m) = 



k=l 



71 



-1) 



k+1 



k k r 



For the moment generating function of £ n _i we have 



log(E e 



1) = logM„_i(s/logn) 



log s + loi 



logra 



+ E 



a(n, m) s m 
' log" 1 n m 



consequently for any N > 1, using part (b) of Lemma HI 



An-l(s) 



logfEe^"- 1 - 1) - Ei(s)| 

oo 

log(E e 3 ^- 1 - 1) - 7- logs - V" 



< 



+ 



lot 



Pn-l 

logn 



7 



A' 



E 



m • ml 



+ E 

m=l 
00 

+ E 



a(n, mj 



log m n 



1 



m=l 
1 

m! 

1 



m ■ ml 

m 



logn 



s 

m 



m=N+l m=N+l 

Fix an e > 0. Assuming \s\ < 1/2, we can choose N so large that the last two summands are 
smaller than e/2 each. Now for a fixed N by Lemma [5] and part (c) of Lemma H] the first two 
terms of the RHS of A n _i(s) go to as n — > 00. Consequently, limsup^^ A n (s) < e. Since e 
is arbitrary, we conclude that for \s\ < 1/2 

lim Ee^"" 1 = 1 + exp {Ei(s)} . 

n^oo 

By Theorem 3 in [9], if the sequence of moment-generating functions corresponding to random 
variables £ ra converges point-wise to a limit function ip^(s) on some interval around 0, then there 
is a random variable £ such that £ n — > £ in distribution and y^(s) is its moment generating 
function. This finishes the proof. ■ 

Theorem 2 Random variable £ defined in Theorem [1\ has the density function f(x) and the 
survival function p(x) = P(£ > x) satisfying 



/(*) 



and 



p(x) 
xp'(x) 

so that p(x) is the Dickman function. 



-- 0, x < 1; 

= -f(x - 1), x > 1, 

1, x < 1; 

— p(a; — 1), x > 1, 



(4) 



(5) 
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Proof. Let us denote by ip{t) = Ke lt ^ = f^(it), then we have 

ty\t) = ip{t)e il - e il . 

Using formally the inversion formula and the fact that for a random variable Y = 1, Ee*' y = e lt , 
we have 

(where 8 X denotes the Dirac delta-function.) Using integration by parts on the left (again, 
formally) we have 



ix^ J ty(t)e- itx dt = -x-^- J ^(t)e~ itx dt 



= f(x-l) + f(x)-S x ^ 

yielding (xf(x))' = 8 x -i — f(x — 1). Integrating this equality from — oo to x, and denoting 
F(x) = P(£ < x), we obtain 

xF'(x) = l x >x - F(x - 1) 

implying p(x) = 1 — F(x) satisfies xp'(x) = —p(x — 1) for x > 1 as required. 

To prove the above results rigorously, first observe that the Dickman function p(u) has the 
following properties: (1) it is positive and decreasing on the [1, oo); (2) it is infinitely different iable 
on [0, oo] except at integer points; (3) p{u) < \/Y{u + l) for u > 1 (see e.g. [22] for its properties). 
Consequently, 

0, u < 1; 

1 — p(u), u > 1 

is the cumulative distribution function of some continuous random variable ( which density is 
supported on [1, oo). Multiplying the second equation in ([3]) by te tx and integrating, we obtain 

/oo poo 
txF'(x)e tx dx = J (1- F(x - l))te tx dx (6) 

Integrating by parts the RHS of (jEj), we have 

POO 

t lim p(x - l)e tx - [1 - F(0)]e* + / e tx F'(x - 1) da; 

x^oo J x 

POO 

= - e l + e* / e tx F\y) dy = e\ n (t) - 1) 
Jo 



F(u) :-- 



where y^(£) = Ee^ is the moment generating function of (. On the other hand, the LHS of 
equals 



d f°° 

t — J F'(x)e tx dx = t<p' ( (t). 



This yields tip' At) = e*(^(i) — 1) and <^(0) = 1, a general solution to which has a form 



(fc,{t) = 1 + CiS exp ) 



\m=l 



m ■ ml 



for some constant C\. To identify C\, we will use the fact that (p c (-z) = Ee" zC -> as z — > oo 
(as £ > 0). Using Taylor expansion for e* we obtain 

1 - e~* 



= 1 — Ci^exp 



dt . 



Now, formulas 5.1.1 and 5.1.39 in pQ for the function E\(z) give 



1 - e 



oo _t 
^(z) = j^dt, 

z 

dt = E x {z) +log^ + 7 



yielding 



(fc(-z) = 1 - Ci exp -7 



oo e _ t 



dt . 



(7) 



Since the integral goes to as z — > oo, we conclude that Ci = e 7 . Thus coincides with the 
expression given by (J3]) and by the uniqueness theorem, £ must have the same distribution as £, 
from which the Theorem follows. ■ 



Here are a few observations about the distribution of £. Trivially we have .F(x) 
thus using (jSJ) for < a; < 1 we have F'(x) = whence 

= 0,0 < x < 1 

as well. Consequently, for 1 < x < 2, we have xF'(x) = 1 so that 

F(a;) = logx, 1 < x < 2. 
Therefore, by induction we can obtain piece-wise smooth density function of £: 



for x < 0; 



0. 

l/x, 

l-lo g ( x -l) 



x < 1; 

1 < ac < 2; 

2 < x < 3; 



Unfortunately, there is no explicit formula in elementary functions for f(x) on an interval [n, n+l] 
for n > 2. 

Our next statement deals with residual waiting times for the renewal process generated by 
consecutive burnouts at site n. 

Proposition 1 Let r] t n be the time till the next burnout at site n after time t > 0. Then 
T] tt n/ \ogn — >fj n as t — > oo and fj n — >fj as n — > oo ; where fj has a generalized Dickman distribu- 
tion GD(1), see J77| / ; i.e. the same distribution as U\ + U1II2 + U1U2U3 + . . . with Ui being i.i.d. 
uniform [0, 1] random variables. 



Proof. As we already know, the times between consecutive burnouts r n ' 



(0 



1,2, 



are i.i.d. 



and have a common distribution of r n . Let = ri /logn, and let F n (-) be the common 
cumulative distribution function of £n\ which is the same as for the random variable £ n defined 
in Theorem [TJ As it is well-known, see e.g. [TU], Chapter 3.4, the residual waiting times for the 
renewal process generated by converge in distribution to a non-negative random variable fj n 
such that 



< x) 



[1-F n (u))du, for all x > 0. 



(We need to verify that the distribution F n is non-arithmetic, however this easily follows from 
the fact that r n is a continuous random variable, which is a mixture of Gamma distributions, as 
implied by Lemma [TJ) 

Let F be the cumulative distribution function of £, as defined in the proof of Theorem [21 and 
77 be a non-negative random variable such that 



< x) 



— — / (1 — F(u)) du = e -7 / p{u) du, for all x > 0. 
^( Jo Jo 



Then, since /^(l — F n {u)) du < / °°(1 — F n (u)) du = E£ n , 



mfjn < x) -¥(fj < x)\ < 



±J\ Fn (u)-F(u))du 

[ (1-F n (u))du 
Jo 



< 



\FJu)-F(u)\ du + 



- 1 







where the first summand tends to by the dominated convergence theorem since F n (x) — > F(x) 
pointwise by Theorems [TJ and [21 and the second one vanishes because of Lemma [21 Therefore 



v _ 

Vn >V- 
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We finish the proof by noting that the distribution in (jSJ) coincides with the distribution of 
E"iir i= i^.«e|8l- ■ 

We conclude by noting that similar distributions (called Dickman-type distributions) show 
up in some other probabilistic models, including e.g. minimal directed spanning trees as well as 
number-theory related problems, see [T7j and references therein. Another interesting application 
is in economics, related to plot-size distributions: see [TT], formula (4), which is identical to that 
for fj. 

2 Generalizations 

One can consider a similar forest fire model on an arbitrary connected locally- finite graph G 
with the vertex set V(G) and one special vertex vq G V(G) which is called the origin. Let 
Vx(t) G {0, 1} be the state of site x G V(G) at time t > 0; again the site x is vacant (occupied 
resp.) if r] x = (r) x = 1 resp.). Vacant sites become occupied at rate 1; they remain occupied 
until they are burnt out, which makes them vacant again. For definiteness, at time all sites 
are vacant. As before, only site vq is constantly hit by lightning, hence whenever it becomes 
occupied all the sites in the cluster of occupied sites containing v are instantaneously burnt out. 

Unfortunately, this model turns out to be not so interesting, provided that the critical per- 
colation threshold p c for site percolation on G is strictly smaller than 1, which is true on many 
graphs. Recall that if Vo (p) = 6(p) denotes the probability that site vq belongs to an infinite 
cluster of occupied sites given that each site is independently occupied with probability p, then 
the critical percolation threshold is defined by 

p c = sup{p : 9(p) = 0} 

(see for example [13]). 

We claim that if p c < 1, then in our forest fire model infinitely many sites can be burnt in a 
finite time. Indeed, fix a p G (p c , 1), and let 

S = S(p) = -\og(l-p). (9) 

Then with probability at least 6(p) > site vq becomes occupied in time exceeding S (at 
which point it is immediately burnt), and by that time there will be already an infinite cluster 
attached to vq, so that it will burn some arbitrarily far away vertices. 

As it is well-known, on many graphs (Z d , d > 2, regular trees, some others) the number of 
infinite occupied clusters can be either 0, 1, or oo (see [IS]); also it is known that on Z d , d > 2, 
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and some infinite Cayley graphs (but not a regular tree) the infinite cluster, whenever present, 
must be unique; see [BJ, [T5] , and also Chapter 8.9 in [13] and Theorem 4 in Chapter 5.1 in [2J. 
Additionally, suppose that the graph is transitive, that is to say that graph G viewed from any 
vertex v G V(G) is isomorphic to graph G viewed from vq\ this in turn would imply using the 
FKG inequality for the connectivity function ([13]. Chapter 8.5) that the probability that an 
arbitrary chosen vertex v is burnt out in time S exceeds (1 — 7)(1 — p) where 

T :=l--^-G 0,1 . 10 

p 

We can generalize this argument as follows. 

Theorem 3 Suppose that graph G is connected, transitive, the critical point for the site perco- 
lation p c = p c {G) < 1 and that there can be at most one infinite cluster on G. Fix an arbitrary 
v G V(G) and let rj be the time till its first burnout in our forest fire model. Then for any 
p G (p c , 1) and j = 0, 1, 2, . . . 

P (V > x ) < l~ l b(l - P) + 1] e ~ Xx f° r allx>0 (11) 
where 7 is given by [W\) . and A = A (7) > is the smallest positive solution of 

V (X)= 1 - 1 (12) 

r 1 _1 

with ip(t) = 1 — (!_p)i-t = [1 — ie 5 ^ 1- *- 1 ] -1 and S being defined by 
Remark 4 The function ip(t) satisfies the following properties: 

• <p(o) = i; 

• ip(t) is positive and finite on [0,t max ) where t max = t max (S) is the smallest positive solution 
of I = te s{1 - l) , that is 



' max (S) 



1, for S<1, 

-LambertW(-5e- 5 )/5 for S > 1 



where LambertW is the Lambert W function; 

• *max < owe? /ience <£>'(£) oc (1 — tS 1 ) > for t < t max (^easy to check); 

• <p(t) t +OO as t t tmax- 

Therefore, the solution to |Hj) indeed exists for any < 7 < 1 . 
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Proof of Theorem As we have already established, the probability that an arbitrary vertex 
v is burnt out in time S is at least (fTUj) ; this would be obviously also true even if some of the 
vertices v G V(G) \ {vq} were already occupied at time 0. Denote by T(1),T(2), . . . the times 
of ignitions of vertex Vq, set T(0) = and let r(n) = T(n) — T(n — 1) be the (exponentially(l) 
distributed) times between consecutive burnouts. Let N = N(x) be the number of intervals r(i) 
of length at least S entirely lying inside [0,x], that is 

N(x) = card{i : T; > S, T; < x} 

To get a handle on N(x), we will use the renewal theory approach. Let 

i = 0, 

i k = min{i > i k _ t : t { > S, r,- < S V j G fe = 1, 2, 3, . . . 

Then T(ifc) form a renewal process, and 

N(x) = max{A; : T{i k ) < x} = max{A; : v\ + + • • • + v k < x} 

where v k '■= T(i k+ i) — T{i k ) are i.i.d. random variables, and if ip u (t) = tp(t) denotes its mo- 
ment generating function which we will need later, then, by conditioning on n and using the 
memoryless property, we obtain 

^t) = Ee^^E^l^] +E [e tT ^)l T1>5 ] 

rS poo 

— E e tu+T(il) e- u du+ / e tu e- u du 

Jo Js 

- — [(1 - e-W) <p{t) + e~^ s 



l-t 



yielding 



<p{t) 



1 - te s ( 1 -*) 



which is defined for all t < t max . In particular, = <p'(0) = e s , and thus we expect N(x) to be 
typically around xe~ s = x(l — p). 

On the other hand, by the arguments preceding the statement of the Theorem, conditioned 
on N(x), the probability that v has not been burnt out in time x is smaller than j N ( x \ hence 

oo 

F(»7 > x) < E 7 Ar(x) = ^ 7 n P(A^(x) = n). 

n=0 
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n=0 



We split the sum above into two parts and estimate it as follows: 

oo [x(l-p)\-l oo 

^2j n F(N{x) =n) < l n ^{N(x)=n)+ ^ l n ^{N{x) = n) 

n=0 n=[x(l— p)J 

Lx(i- P )j-i 

< J n W(N(x) <n)+ 7 ^( 1 -p)Jp (N(x) > [x(l - p)J ) 

n=0 
[x(l-p)}-l 

< Yl I n n^ + --- + ^n+l>x)+ 1 xil -^- 1 (13) 
n=0 



< 



7 



ar(l - p) max 7 m PK + • • • + v m > x) + 7 x ( 1 ~ p ) 

m6{l,...,b(l-p)J} 



From Markov inequality, we have for any t > 

7 m P(z/i H h z/ m > x) < e A( *' m) where A(t, to) = to log 7 + to log <£>(£) - tx. 

We will bound log [7 m P(z/x + • • • + v m > x)] by max < m < I ( 1 _ p ) min t>0 A(t, to). From well-known 
properties of the MGF we know that logip(t) and hence A(t,m) is convex in t, therefore the 
latter achieves a unique minimum at point t* = t*(x/m) where t*(a) solves the equation 

Also, for to < x(l—p) we have t* > as dA(t, m)/dt \ t=o = rrup'(0)/<p(0)— x = m(l—p)~ 1 —x < 0, 
yielding min 4 > A(t, to) = A(t*(x/m), to). Additionally, t* (jz^) = 0, t*(a) is increasing in a as 
dlog <£>(£)/ dt is increasing, and it is easy to check in our case (p(t*(a)) — ► oo as a — > oo. 
On the other hand, 



dA(t*(x/m), m) 
dm 



log 7 + log<^(t*) + 



TO ; — : X 



dt*(x/m) 
dm 



log 



IV [t* 



x 



The RHS of this expression decays in to; moreover as to j 0, — — > +oo resulting in (p(t*(x/m)) 



+oo and 



dA(i*(a;/m),m) 



dA(t* (x/m),m) 



dm 



dm 



m=x(l—p) 



m=0 



+oo. At the same time, for m = x(l—p) we have t*(x/m) = hence 



log 7 < 0. Therefore, the maximum of A(t*(x/m),m) is achieved at some 



dA(t* (x / m) ,m) 



intermediate m and this maximum equals —Ax where A = t*(x/m) solves ([ 
equation (fT2l) . Finally, observe that 



0, i.e. 



7' 



x(l-p) 



exp{A(0,x(l -p))} = exp{A(f ((1 -p)" i ),x(l -p))} 



< exp <| max A(t*(x/m),m) 

0<m<x(l—p) 



- A.c 



Now (Pj) yields ([IT]). 



13 



3 Proofs of the combinatorial results 



Proof of Lemma Observe that 
A n -i 



E 

i=2 
n 

E(-d 



l)'logi = J] 

i=2 



1=2 



n 
i + 1 



-iy 

n 
i + 2 



i 2 i 3 * 
log - + log - H h log - 

1 2 z — 1 



± 



log 



i - 1 



i=2 



n-1 



v 7 fe=i 



fc _i Ai — 1\ _ fc + 1 
- ]/ Mog 



fc 



fc ' 



hence 



A„ = B-D'-'g) log 



fc + 1 
fc ' 



To estimate the above quantity, we use the partial fractions method the way it is employed in 
[20] . equation (8), and in [21], Example 5.8, 



nl 



x(x + 1) . . . (x + n) 



i-y^V-i)*- 1 — . 

x ^ Vfc/ X + fc 

fc=l v 7 



Consequently, 



n! 



dx. 



x x(x + 1) . . . (x + n) 
(In fact, there is yet another formula for A n in [18], 5.5.1, saying that 



(14) 



£<-*(:) 

fc=i v 7 



log 



fc + a 
fc + 6 



-log~+ / (^-^(l-t) 



df 



hence log A n = lim H0 log^ 1 - /^(l - t 6_1 )(l - t) n (logt)- 1 dt 
estimate this limit and hence decided to work directly with ffl4|) . 
Let us rearrange (TH|) as follows: 

-i 



log* 

Unfortunately, we could not 



-A r . 



1 



1 



_x x(l + x/l)(l + x/2) . . . (1 + x/n) 
Using standard Taylor series expansion for |x| < 1 we have 



dx 



oo m 

log (1 + x ) 1 + I)... 1 + £ =£^.(-1)-*. 



m=l 
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where H n ra = ^fc=i ^ m are the generalized harmonic numbers. Moreover 



H n = H njl = 7 + logn+ — + 0(n 



-2\ 



2 n 



(15) 



CM - 7 ry^rr + 0(^ m ), m = 2, 3, 

(m — l)n m 1 



with ((s) = ^2T = i k~ s being the Riemann zeta function and 7 = 0.577... the Euler constant. The 
first equality in (1151) follows from the asymptotic for the Digamma function ip(x) = dlo ^^- ) (see 
6.3.2 and 6.3.18 in pQ) while the second one is an elementary consequence of the fact that 

^ . rr ^ 1 f°° dx 1 f°° dx 

C(m) - H n , m = ^ — , while / — < ^ — < / — . 

fc=n+l ^ X fc=n+l K Jn X 

By changing the variables x = yj logn in the integral, we obtain 



^4* 



logn^ 

y 



1 - 



exp(B n (y)) 



where 



B »<^ + i^(^ + ("- 2 >) 



- T iL^(c(2)-- + 0( n - 2 )] 

z(logn)' 2 \ n 



Integrating separately on [0, 1] and [l,logn] we obtain 



^4* 



/•logn dy _ f l 
Jl v Jl 



logn g-Bniv) dy 
V 



Jo f [1 - 



log logn - JJ* 

log log ?2 + 



/0 j/ 



7 



+ o(l), 



by plugging z — 1 into (J7J), taking into account that for y e [0, 1] 



1 - e- B " fe) = (1 - e^) + + i/x 0((logn)- 2 ) 

log n 



and at the same time 



m e -B n (y) dy 
1 1/ 

logn e -B n {y) 



+ 



\/logn 



00 e"* dy 



< 



1 y 

log n 1/ 



Vlog n I -B n (y) _ p -y 



e~ y \ dy 



=: (I) + (II) + (III), 
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where 



(I) < 
(II) = 



(III) < 



v/T °s™ Const e-v dy < Const 



Vlogn y ^log n ' 



dx 



< 



(logn.) 



y h x(l + x/1) . . . (1 + x/n) J ilogn) -h x (1 +x (1 + I + • • ■ + i)) 



log 



a; 



l + #« 



log 



1 



(logn) 2 



e v dy 



H n + y/Eg 

#n + l 



/7 



v / logri 



(1 + 0(1)), 



Vlogra 



Proof of Lemma [H The derivation of (a) is fairly straightforward by induction; it can also be 
recovered from Section 4 in [7j. 
To establish (b), note that 

n n n / " i \ m 



Zl« 2 ...«m \ , 1 

tl=l »2 = 1 «m = l \l=l 



and H n < 1 + log n for n > 1 . 
Finally, to prove (c), let 



a(n, m) := 



ZlZ 2 ...Z m 

1<11<12<— <« m <n 

(observe that here all z&'s must be distinct). From equation (3.2) in [Hj it follows that for a fixed 
m satisfy 

(logn)™ , 7(log") m " 1 , (7 2 -C(2))log m ~ 2 n 

a(n, m) = : — + — — - + — — + . . . 

ml (m — 1)1 (m — 2)! 2 

On the other hand, 

m— 1 

< a(n, m) — a(n, m) = 

r=l l<ii 

m— 1 / n ^ \ 

( 13 ) ^ 

+2< 

< 2ma(n,m - 2) < 2m(logn+ l) m ~ 2 



ZxZ 2 • • • V 

r=l l<jl<j2<---<«r=ir+l<ir+2<---<«m<n 



k 2 / ^— ' Zli 2 ■ ■ - if 

r=l \k=l / l<ii<---<j r _i<i r +2<---<i m <"- 
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Therefore, 

f \ (1( 

nt n ml = — 

m\ (m — 1)! 

similar to a(n, m). 



. (lognp 7 (logn) m - 1 m _ 2 . 

a(n, m) = v & / + ') & \ + 0(log m 2 n) 
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